1,631 research outputs found

    Development and application of software and algorithms for network approaches to proteomics data analysis

    Get PDF
    The cells making up all living organisms integrate external and internal signals to carry out the functions of life. Dysregulation of signaling can lead to a variety of grave diseases, including cancer [Slamon et al., 1987]. In order to understand signal transduction, one has to identify and characterize the main constituents of cellular signaling cascades. Proteins are involved in most cellular processes and form the major class of biomolecules responsible for signal transduction. Post-translational modifications (PTMs) of proteins can modulate their enzymatic activity and their protein-protein interactions (PPIs) which in turn can ultimately lead to changes in protein expression. Classical biochemistry has approached the study of proteins, PTMs and interaction from a reductionist view. The abundance, stability and localization of proteins was studied one protein at a time, following the one gene-one protein-one function paradigm [Beadle and Tatum, 1941]. Pathways were considered to be linear, where signals would be transmitted from a gene to proteins, eventually resulting in a specific phenotype. Establishing the crucial link between genotype and phenotype remains challenging despite great advances in omics technologies, such as liquid chromatography (LC)-mass spectrometry (MS) that allow for the system-wide interrogation of proteins. Systems and network biology [Barabási and Oltvai, 2004, Bensimon et al., 2012, Jørgensen and Locard-Paulet, 2012, Choudhary and Mann, 2010] aims to transform modern biology by utilizing omics technologies to understand and uncover the various complex networks that govern the cell. The first detected large-scale biological networks have been found to be highly structured and non-random [Albert and Barabási, 2002]. Furthermore, these are assembled from functional and topological modules. The smallest topological modules are formed by the direct physical interactions within protein-protein and protein-RNA complexes. These molecular machines are able to perform a diverse array of cellular functions, such as transcription and degradation [Alberts, 1998]. Members of functional modules are not required to have a direct physical interaction. Instead, such modules also include proteins with temporal co-regulation throughout the cell cycle [Olsen et al., 2010], or following the circadian day-night rhythm [Robles et al., 2014]. The signaling pathways that make up the cellular network [Jordan et al., 2000] are assembled from a hierarchy of these smaller modules [Barabási and Oltvai, 2004]. The regulation of these modules through dynamic rewiring enables the cell to respond to internal an external stimuli. The main challenge in network biology is to develop techniques to probe the topology of various biological networks, to identify topological and functional modules, and to understand their assembly and dynamic rewiring. LC-MS has become a powerful experimental platform that addresses all these challenges directly [Bensimon et al., 2012], and has long been used to study a wide range of biomolecules that participate in the cellular network. The field of proteomics in particular, which is concerned with the identification and characterization of the proteins in the cell, has been revolutionized by recent technological advances in MS. Proteomics experiments are used not only to quantify peptides and proteins, but also to uncover the edges of the cellular network, by screening for physical PPIs in a global [Hein et al., 2015] or condition specific manner [Kloet et al., 2016]. Crucial for the interpretation of the large-scale data generated by MS experiments is the development of software tools that aid researchers in translating raw measurements into biological insights. The MaxQuant and Perseus platforms were designed for this exact purpose. The aim of this thesis was to develop software tools for the analysis of MS-based proteomics data with a focus on network biology and apply the developed tools to study cellular signaling. The first step was the extension of the Perseus software with network data structures and activities. The new network module allows for the sideby-side analysis of matrices and networks inside an interactive workflow and is described in article 1. We subsequently apply the newly developed software to study the circadian phosphoproteome of cortical synapses (see article 2). In parallel we aimed to improve the analysis of large datasets by adapting the previously Windows-only MaxQuant software to the Linux operating system, which is more prevalent in high performance computing environments (see article 3)

    A network module for the perseus software for computational proteomics facilitates proteome interaction graph analysis

    Get PDF
    Proteomics data analysis strongly benefits from not studying single proteins in isolation but taking their multivariate interdependence into account. We introduce PerseusNet, the new Perseus network module for the biological analysis of proteomics data. Proteomics is commonly used to generate networks, e.g., with affinity purification experiments, but networks are also used to explore proteomics data. PerseusNet supports the biomedical researcher for both modes of data analysis with a multitude of activities. For affinity purification, a volcano-plot-based statistical analysis method for network generation is featured which is scalable to large numbers of baits. For posttranslational modifications of proteins, such as phosphorylation, a collection of dedicated network analysis tools helps in elucidating cellular signaling events. Co-expression network analysis of proteomics data adopts established tools from transcriptome co-expression analysis. PerseusNet is extensible through a plugin architecture in a multi-lingual way, integrating analyses in C#, Python, and R, and is freely available at http://www.perseus-framework.org.publishedVersio

    Polyether-tethered imidazole-2-thiones, imidazole-2-selenones and imidazolium salts as collectors for the flotation of lithium aluminate and spodumene

    Get PDF
    Imidazolium salts were prepared which possess 2-ethoxyethyl pivalate or 2-(2-ethoxyethoxy)ethyl pivalate groups as amphiphilic side chains with oxygen donors as well as n-butyl substituents as hydrophobic groups. The N-heterocyclic carbenes of the salts, characterized by 7 Li and 13 C NMR spectroscopy as well as by Rh and Ir complex formation, were used as starting materials for the preparation of the corresponding imidazole-2-thiones and imidazole-2-selenones. Flotation experiments in Hallimond tubes under variation of the air flow, pH, concentration and flotation time were performed. The title compounds proved to be suitable collectors for the flotation of lithium aluminate and spodumene for lithium recovery. Recovery rates up to 88.9% were obtained when the imidazole-2-thione was used as collector

    Multifocal high-grade glioma radiotherapy safety and efficacy

    Get PDF
    BACKGROUND Multifocal manifestation of high-grade glioma is a rare disease with very unfavourable prognosis. The pathogenesis of multifocal glioma and pathophysiological differences to unifocal glioma are not fully understood. The optimal treatment of patients suffering from multifocal high-grade glioma is not defined in the current guidelines, therefore individual case series may be helpful as guidance for clinical decision-making. METHODS Patients with multifocal high-grade glioma treated with conventionally fractionated radiation therapy (RT) in our institution with or without concomitant chemotherapy between April 2011 and April 2019 were retrospectively analysed. Multifocality was neuroradiologically assessed and defined as at least two independent contrast-enhancing foci in the MRI T1 contrast-enhanced sequence. IDH mutational status and MGMT methylation status were assessed from histopathology records. GTV, PTV as well as the V30Gy, V45Gy and D2% volumes of the brain were analysed. Overall and progression-free survival were calculated from the diagnosis until death and from start of radiation therapy until diagnosis of progression of disease in MRI for all patients. RESULTS 20 multifocal glioma cases (18 IDH wild-type glioblastoma cases, one diffuse astrocytic glioma, IDH wild-type case with molecular features of glioblastoma and one anaplastic astrocytoma, IDH wild-type case) were included into the analysis. Resection was performed in two cases and stereotactic biopsy only in 18 cases before the start of radiation therapy. At the start of radiation therapy patients were 61~years old in median (range 42-84~years). Histopathological examination showed IDH wild-type in all cases and MGMT promotor methylation in 11 cases (55%). Prescription schedules were 60~Gy (2~Gy × 30), 59.4~Gy (1.8~Gy × 33), 55~Gy (2.2~Gy × 25) and 50~Gy (2.5~Gy × 20) in 15, three, one and one cases, respectively. Concomitant temozolomide chemotherapy was applied in 16 cases, combined temozolomide/lomustine chemotherapy was applied in one case and concomitant bevacizumab therapy in one case. Median number of GTVs was three. Median volume of the sum of the GTVs was 26 cm3. Median volume of the PTV was 425.7 cm3 and median PTV to brain ratio 32.8 percent. Median D2% of the brain was 61.5~Gy (range 51.2-62.7) and median V30Gy and V45 of the brain were 59.9 percent (range 33-79.7) and 40.7 percent (range 14.9-64.1), respectively. Median survival was eight months (95% KI 3.6-12.4~months) and median progression free survival after initiation of RT five months (95% CI 2.8-7.2~months). Grade 2 toxicities were detected in eight cases and grade 3 toxicities in four cases consisting of increasing edema in three cases and one new-onset seizure. One grade 4 toxicity was detected, which was febrile neutropenia related to concomitant chemotherapy. CONCLUSION Conventionally fractionated RT with concomitant chemotherapy could safely be applied in multifocal high-grade glioma in this case series despite large irradiation treatment fields

    MaxDIA enables library-based and library-free data-independent acquisition proteomics

    Get PDF
    MaxDIA is a software platform for analyzing data-independent acquisition (DIA) proteomics data within the MaxQuant software environment. Using spectral libraries, MaxDIA achieves deep proteome coverage with substantially better coefficients of variation in protein quantification than other software. MaxDIA is equipped with accurate false discovery rate (FDR) estimates on both library-to-DIA match and protein levels, including when using whole-proteome predicted spectral libraries. This is the foundation of discovery DIA—hypothesis-free analysis of DIA samples without library and with reliable FDR control. MaxDIA performs three- or four-dimensional feature detection of fragment data, and scoring of matches is augmented by machine learning on the features of an identification. MaxDIA’s bootstrap DIA workflow performs multiple rounds of matching with increasing quality of recalibration and stringency of matching to the library. Combining MaxDIA with two new technologies—BoxCar acquisition and trapped ion mobility spectrometry—both lead to deep and accurate proteome quantification.publishedVersio

    UBVRI Light Curves of 44 Type Ia Supernovae

    Get PDF
    We present UBVRI photometry of 44 type-Ia supernovae (SN Ia) observed from 1997 to 2001 as part of a continuing monitoring campaign at the Fred Lawrence Whipple Observatory of the Harvard-Smithsonian Center for Astrophysics. The data set comprises 2190 observations and is the largest homogeneously observed and reduced sample of SN Ia to date, nearly doubling the number of well-observed, nearby SN Ia with published multicolor CCD light curves. The large sample of U-band photometry is a unique addition, with important connections to SN Ia observed at high redshift. The decline rate of SN Ia U-band light curves correlates well with the decline rate in other bands, as does the U-B color at maximum light. However, the U-band peak magnitudes show an increased dispersion relative to other bands even after accounting for extinction and decline rate, amounting to an additional ~40% intrinsic scatter compared to B-band.Comment: 84 authors, 71 pages, 51 tables, 10 figures. Accepted for publication in the Astronomical Journal. Version with high-res figures and electronic data at http://astron.berkeley.edu/~saurabh/cfa2snIa

    Parent-of-origin-specific allelic associations among 106 genomic loci for age at menarche.

    Get PDF
    Age at menarche is a marker of timing of puberty in females. It varies widely between individuals, is a heritable trait and is associated with risks for obesity, type 2 diabetes, cardiovascular disease, breast cancer and all-cause mortality. Studies of rare human disorders of puberty and animal models point to a complex hypothalamic-pituitary-hormonal regulation, but the mechanisms that determine pubertal timing and underlie its links to disease risk remain unclear. Here, using genome-wide and custom-genotyping arrays in up to 182,416 women of European descent from 57 studies, we found robust evidence (P < 5 × 10(-8)) for 123 signals at 106 genomic loci associated with age at menarche. Many loci were associated with other pubertal traits in both sexes, and there was substantial overlap with genes implicated in body mass index and various diseases, including rare disorders of puberty. Menarche signals were enriched in imprinted regions, with three loci (DLK1-WDR25, MKRN3-MAGEL2 and KCNK9) demonstrating parent-of-origin-specific associations concordant with known parental expression patterns. Pathway analyses implicated nuclear hormone receptors, particularly retinoic acid and γ-aminobutyric acid-B2 receptor signalling, among novel mechanisms that regulate pubertal timing in humans. Our findings suggest a genetic architecture involving at least hundreds of common variants in the coordinated timing of the pubertal transition

    BRCA2 polymorphic stop codon K3326X and the risk of breast, prostate, and ovarian cancers

    Get PDF
    Background: The K3326X variant in BRCA2 (BRCA2*c.9976A&gt;T; p.Lys3326*; rs11571833) has been found to be associated with small increased risks of breast cancer. However, it is not clear to what extent linkage disequilibrium with fully pathogenic mutations might account for this association. There is scant information about the effect of K3326X in other hormone-related cancers. Methods: Using weighted logistic regression, we analyzed data from the large iCOGS study including 76 637 cancer case patients and 83 796 control patients to estimate odds ratios (ORw) and 95% confidence intervals (CIs) for K3326X variant carriers in relation to breast, ovarian, and prostate cancer risks, with weights defined as probability of not having a pathogenic BRCA2 variant. Using Cox proportional hazards modeling, we also examined the associations of K3326X with breast and ovarian cancer risks among 7183 BRCA1 variant carriers. All statistical tests were two-sided. Results: The K3326X variant was associated with breast (ORw = 1.28, 95% CI = 1.17 to 1.40, P = 5.9x10- 6) and invasive ovarian cancer (ORw = 1.26, 95% CI = 1.10 to 1.43, P = 3.8x10-3). These associations were stronger for serous ovarian cancer and for estrogen receptor–negative breast cancer (ORw = 1.46, 95% CI = 1.2 to 1.70, P = 3.4x10-5 and ORw = 1.50, 95% CI = 1.28 to 1.76, P = 4.1x10-5, respectively). For BRCA1 mutation carriers, there was a statistically significant inverse association of the K3326X variant with risk of ovarian cancer (HR = 0.43, 95% CI = 0.22 to 0.84, P = .013) but no association with breast cancer. No association with prostate cancer was observed. Conclusions: Our study provides evidence that the K3326X variant is associated with risk of developing breast and ovarian cancers independent of other pathogenic variants in BRCA2. Further studies are needed to determine the biological mechanism of action responsible for these associations

    Integration of clinical parameters and CT-based radiomics improves machine learning assisted subtyping of primary hyperaldosteronism

    Get PDF
    ObjectivesThe aim of this study was to investigate an integrated diagnostics approach for prediction of the source of aldosterone overproduction in primary hyperaldosteronism (PA).Methods269 patients from the prospective German Conn Registry with PA were included in this study. After segmentation of adrenal glands in native CT images, radiomic features were calculated. The study population consisted of a training (n = 215) and a validation (n = 54) cohort. The k = 25 best radiomic features, selected using maximum-relevance minimum-redundancy (MRMR) feature selection, were used to train a baseline random forest model to predict the result of AVS from imaging alone. In a second step, clinical parameters were integrated. Model performance was assessed via area under the receiver operating characteristic curve (ROC AUC). Permutation feature importance was used to assess the predictive value of selected features.ResultsRadiomics features alone allowed only for moderate discrimination of the location of aldosterone overproduction with a ROC AUC of 0.57 for unilateral left (UL), 0.61 for unilateral right (UR), and 0.50 for bilateral (BI) aldosterone overproduction (total 0.56, 95% CI: 0.45-0.65). Integration of clinical parameters into the model substantially improved ROC AUC values (0.61 UL, 0.68 UR, and 0.73 for BI, total 0.67, 95% CI: 0.57-0.77). According to permutation feature importance, lowest potassium value at baseline and saline infusion test (SIT) were the two most important features.ConclusionIntegration of clinical parameters into a radiomics machine learning model improves prediction of the source of aldosterone overproduction and subtyping in patients with PA
    corecore